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Abstract. We consider the bifurcating Markov chain model introduced by Guyon to detect 
cellular aging from cell lineage. To take into account the possibility for a cell to die, we use 
an underlying Galton- Watson process to describe the evolution of the cell lineage. We give 
in this more general framework a weak law of large number, an invariance principle and thus 
fluctuation results for the average over one generation or up to one generation. We also 
prove the fluctuations over each generation are independent. Then we present the natural 
modiflcations of the tests given by Guyon in cellular aging detection within the particular 
case of the auto-regressive model. 



1. Introduction 

This work is motivated by experiments done by biologists on Escherichia coli, see Stewart 
and al. [16J. E. coli is a rod-shaped single celled organism which reproduces by dividing in 
the middle. It produces a new end per progeny cell. We shall call this new end the new pole 
whereas the other end will be called the old pole. The age of a cell is given by the age of 
its old pole (i.e. the number of generations in the past of the cell before the old pole was 
produced). Notice that at each generation a cell gives birth to 2 cells which have a new pole 
and one of the two cells has an old pole of age 1 (which corresponds to the new pole of its 
mother), while the other has an old pole with age larger than one (which corresponds to the 
old pole of its mother). The former is called the new pole daughter and the latter the old pole 
daughter. Experimental data, see [16], suggest strongly that the growth rate of the new pole 
daughter is significantly larger than the growth rate of the old pole daughter. For asymmetric 
aging see also Ackermann [2] for an other case of asymmetric division, and Lindner and al. 
|12j or Ackermann and al. [T] on asymmetric damage repartition. 

Guyon ^ studied a mathematical model, called bifurcation Markov chain (BMC), of an 
asymmetric Markov chain on a regular binary tree. This model allows to represent an asym- 
metric repartition for example of the growth rate of a cell between new pole and old pole 
daughters. Using this model, Guyon provides tests to detect a difference of the growth rate 
between new pole and old pole on a single experimental data set, whereas in [16j averages over 
many experimental data sets have to be done to detect this difference. In the BMC model, 
cells are assumed to never die (a death corresponds to no more division). Indeed few death 
appear in normal nutriment saturated conditions. However, under stress condition, dead cells 
can represent a significant part of the population. It is therefore natural to take this random 
effect into account by using a Galton- Watson (GW) process. Our purpose is to study a model 
of bifurcating Markov chains on a Galton Watson tree instead of a regular tree. Notice that 
inferences on symmetric bifurcating processes on regular trees have been studied, see the sur- 
vey of Hwang, Basawa and Yea [TT] and the seminal work of Cowan and Staudte [7j . We also 
learned of a recent independent work on inferences for asymmetric auto-regressive models by 
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Bercu, De Saporta and Gegout-Petit [6j. Other models on cell lineage with differentiation 
have been investigated, see for example Bansaye [H [4] on parasite infection and Evans and 
Steinsaltz [8] on asymptotic models relying on super-Brownian motion. 

1.1. The statistical model. In order to study the behavior of the growth rate of cells 
in |16| . we set some notations: we index the genealogical tree by the regular binary tree 
T = {0} U UfceN*{0) 1}'^; is the label of the ancestor and if i denotes a cell, let iO denote the 
new pole progeny cell, and il the old pole progeny cell. The growth rate of cell i is Xj. When 
the mother gives birth to two cells among which a unique one divides, we consider that the 
cell which doesn't divide doesn't grow. We work with the following model (see Section 11.21 
for a more general model of BMC on GW tree): 

• With probability pi^o, i gives birth to two cells iO and il which will both divide. The 
growth rates of the daughters Xjo and Xn are then linked to the mother's one Xi 
through the following auto-regressive equations 

\ f XiQ = aoXi + Po + EiO 

^ ' \ Xii = aiXi + Pi+Eii, 

where ao, ai G (—1, 1), f3o, /?i G M and {{eio,£ii),i G T) is a sequence of independent 
centered bi-variate Gaussian random variables, with covariance matrix 

cr'l^l 5^), a'>0, pG(-l,l). 

• With probability po, only the new pole iO divides. Its growth rate X^q is linked to its 
mother's one Xi through the relation 

(2) Xio = a'QX, + p'o + e',o, 

where o'q G (—1,1), /?o G M and {SiQ,i G T) is a sequence of independent centered 
Gaussian random variables with variance ctq > 0. 

• With probability pi, only the old pole il divides. Its growth rate Xn is linked to it's 
mother's one through the relation 

(3) Xii = a[X^ + p[ + e'a, 

where a'l G (—1,1), G M and {e'^iji G T) is a sequence of independent centered 
Gaussian random variables with variance erf > 0. 

• The sequences {{eio,eii),i G T), (e^O'^ ^ (^'ii'^ ^ ^) independent. 

In Section El we first compute the maximum likelihood estimator (MLE) of the parameter 

(4) e = {ao,(3o,ai,Pi,aQ,(3Q,a[,(3[,pifi,po,pi) 

and of K = (o", p,ao,ai). Then, we prove that they are strongly consistent and that the MLE 
of 9 is asymptotically normal, see Proposition 16.31 and Remark 16.41 Notice that the MLE 
of (pi.OiPOjPi)) which is computed only on the underlying GW tree, was already known, see 
for example [13]. Eventually, we explicit a test for aging detection, for instance the null 
hypothesis {{ao,Po) = against its alternative! (qq, /?o) 7^ («i)/9i)}) see Proposition 

16. 7[ It appears that, for those hypothesis, using the test statistic from with incomplete 
data due to death cells instead of the test statistic from Proposition 16.71 is not conservative, 
see Remark 16.81 

To prove those results, we shall consider a more general framework of BMC which is 
described in Section [1.21 An important tool is the auxiliary Markov chain which is defined in 



AGING 



3 



Section II. 3i Eventually easy to read version of our main general results are given in Section 

1.2. The mathematical model of bifurcating Markov chain (BMC). We first intro- 
duce some notations related to the regular binary tree. Let Gq = {0}, = {0,1}^ for 
k €W,Tr= IJ Gfc. The new (resp. old) pole daughter of a cell i G T is denoted by iO 

0<k<r 

(resp. il) and (resp. 1) if i = is the initial cell or root of the tree. The set G^ corresponds 
to all possible cells in the k-th generation. We denote by |i| the generation of i = A; if and 
only if i G Gfc). 

For a cell i £ T, let Xi denote a quantity of interest (for example its growth rate). We 
assume that the quantity of interest of the daughters of a cell i, conditionally on the gen- 
erations previous to i, depends only on Xj. This property is stated using the formalism of 
BMC. More precisely, let {E,£) be a measurable space, P a probability kernel on E x £^ 
with values in [0, 1]: P{-,A) is measurable for all vl G ^ and P{x, •) is a probability measure 
on {E'^,£'^), and for any measurable real-valued bounded function g defined on E"^ we set 



Pg{x)=l g{x,y,z) P{x,dy,dz). 

Definition 1.1. We say a stochastic process indexed byT,X = {Xi,i £ T), is a bifurcating 
Markov chain on a measurable space {E, £■) with initial distribution u and probability kernel 
P, a P-BMC in short, if: 

• X0 is distributed as v. 

• For any measurable real-valued bounded functions {gi,i G T) defined on E^ , we have 
for all /c > 0, 

Ie[ n 9i{X^,X,o,Xa)\a{X,■j G T,,)] = J] Pgi{Xi). 

We consider a metric measurable space {S,S) and add a cemetery point to 5, d. Let 
S = S L) {d} and S be the cj-field generated by S and {d}. (In the biological framework of 
the previous Section, S corresponds to the state space of the quantities of interest and d is 
the default value for dead cells.) Let P* be a probability kernel defined on S x such that 

(5) P*{d,{{d,d)}) = l. 

Notice that this condition means that d is an absorbing state. (In the biological framework 
of the previous Section, condition ([5]) states that no dead cell can give birth to a living cell.) 

Definition 1.2. Let X = {Xi,i e T) be a P* -BMC on {S,S), with P* satisfying fH). We 
call {Xi,i G T*), with T* = {i G T,Xi ^ d}, a bifurcating Markov chain on a Galton-Watson 
tree. The P*-BMC is said spatially homogeneous if pifi = P*{x, S x S), pq = P*{x, S x {d}) 
and pi = P*{x, {d} x S) do not depend on x £ S. A spatially homogeneous P* -BMC is said 
super- critical if m > 1 where m = 2pifi -\- pi +po- 

Notice that condition ([5]) and the spatially homogeneity property implies that T* is a GW 
tree. This justify the name of BMC on a Galton-Watson tree. The GW tree is super-critical if 
and only if m > 1. From now on, we shall only consider super-critical spatially homogeneous 
P*-BMC on a Galton-Watson tree. (In the biological framework of the previous Section, T* 
denotes the sub-tree of living cells and the notations pifl,po and pi are consistent since, for 
instance, P*{x, S x S) represents the probability that a living cell with growing rate x gives 
birth to two living cells.) 
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We now consider the Galton- Watson sub-tree T*. For any subset J C T, let 

(6) J* = jnT ={j e J; Xj / d} 

be the subset of J of hving cells and | J| be the cardinal of J. The process Z = (Z^, G N), 
where = |G^|, is a GW process with reproduction generating function 

tP[z) = {1-pQ-pi- pi^o) + (po +Pl)z+ PlflZ^. 

Notice the average number of daughters alive is m. We have, for k>{), 

(7) n^l\]=m^ and E[|T:|] = ^ £[^1] = ^ m'? = "^^""^ ~ \ 

q=0 q=0 

Let us recall some well-known facts on super-critical GW, see e.g. [lOj or [3]. The extinction 

771 — 1 

probability of the GW process Z is t] = P(|T*| < oo) = 1 . There exists a random 

Pi,o 

variable s.t. 



(8) W = lim m-^IG*! a.s. and in L 
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q-^oo 



F{W = 0) = rj and whose Laplace transform, (/j(A) = E[e ^^], satisfies (p{X) = ip{<f{\/77i)) for 
A > 0. Notice the distribution of W is completely characterized by this functional equation 
and ¥.[W] = 1. 

For i S T, we set Aj = {Xi^Xio^Xn), the mother-daughters quantities of interest. For a 
finite subset J C T, we set 



(9) Mj{f) 



Y.^eJfiX^) for/Ge(5), 
for 



with the convention that Mij){f) = 0, and the following two averages of / over J 

(10) Mj(/) = ^Mj(/) if |J|>0 and M^(/) = -i-^ Mj(/) if E[| J|] > 0. 

We shall study the asymptotic limit of the averages of a function / for the BMC over the 
n-th generation, Mg* (/) and Mg* (/), or over all the generations up to the n-th, Mx* (/) and 
Mj* (/), as 71 goes to infinity. Notice the no death case studied in [9] corresponds to pi^o = 1 
that is m = 2. 

1.3. The auxiliary Markov chain. We define the sub-probability kernel on S" x 5^ P{-, •) = 
P*(-, • Pi 5^) and two sub-probability kernels on x 5: 

^0 =^*(-,(-n'^)^'^) PI =Pi;Sxi-f]S)). 

Notice that Pq (resp. P^) is the restriction of the first (resp. second) marginal of P* to S. 
From spatial homogeneity, we have for all x S S", P{x, S*^) = pi^ and, for 6 G {0, 1}, 

P^{x,{d})=0 and PHx,S)=ps+pi,o. 

We introduce an auxiliary Markov chain (see Guyon [9j for the case m = 2) . Let Y = (Yn , n £ 
N) be a Markov chain on S with Yq distributed as X0 and transition kernel 
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The distribution of Yn corresponds intuitively to the distribution of Xf conditionally on 
I E T*, where / is chosen at random in G^, see Lemma [2. II for a precise statement. We shall 
write Ex when Xijf = x (i.e. is the Dirac mass at x £ S). 

Last, we need some more notations : if {E,£) is a metric measurable space, then Bh{E) 
(resp. B+{E)) denotes the set of bounded (resp. non-negative) real-valued measurable func- 
tions defined on E. The set Cb{E) (resp. C+{E)) denotes the set of bounded (resp. non- 
negative) real-valued continuous functions defined on E. For a finite measure A on {E,£) 
and / G Bb{E) U B+{E) we shall write (A, /) for / f{x)dX{x). 

We consider the following hypothesis {H): 
The Markov chain Y is ergodic, that is there exists a probability measure ji on {S,S) s.t., for 
all f G Cb{S) and all x£S, lim E^ifiYk)] = /). 

fc^oo 

Notice that under (H), the probability measure fi is the unique stationary distribution of 
Y and {Yn,n G N) converges in distribution to fi. 

1.4. The main results. We can now state our principal results on the weak law of large 
numbers and fluctuations for the averages over a generation or up to a generation. Those 
results are a particular case of the more general statements given in Theorem [33] and Theorem 
using Remark! 



Theorem 1.3. Let {Xi,i G T*) be a super- critical spatially homogeneous P*-BMC on a GW 
tree and W be defined by We assume that (H) holds and that x ^ P*g{x) G Cb{S) for 
all g G Cb{P). Let f G Cb{S^). 

• Weak law of large numbers. We have the following convergence in probability: 

i{|Gr,|>o} ^ ^ /(A») (^,p*/)i{wo}> 
i{iG-i>o} i X] •^(^*) 7:^ {fi,p*f)i{wm- 



Fluctuations. We have the following convergence in distribution: 



1{|G*|>0} 



where = {fi,P*{f'^) — {P* f)"^) and G is a Gaussian random variable with mean 
zero, variance 1, and independent ofW. 

One can get the strong law of large numbers under stronger hypothesis on Y (such as geo- 
metric ergodicity) using similar arguments as in [9]. We also can prove that the fluctuations 
over each generation are asymptotically independent. 

Theorem 1.4. Let {Xi,i G T*) be a super- critical spatially homogeneous P*-BMC on a GW 
tree. We assume that (LL) holds and that x P* g{x) G Cb{S) for all g G Cb{S^). Let d>l, 
and for I e {I, . . . ,d] , ft e Cb{S^) and aj = (^,P*(/|) - {P*ft?)- We set for f G Cb{S^) 

Nn{f) = ^ E (/(^'O - P^'fiX.) 



6 



JEAN-FRANgOIS DELMAS AND LAURENCE MARSALLE 



Then we have the following convergence in distribution: 

(^n(/l), • • • , A'n-(i+l(/d)) 1{|G*|>0} JJ^' ^{Wy^O}{(^lGi, . . . ,adGd), 

where Gi, . . . , Gd are independent Gaussian random variables with mean zero and variance 1 
and are independent ofW given by 

Even if the results on fluctuations in Theorem 11.31 are not complete, see the Remark 11.51 
below, they are still sufficient to study the statistical model we gave in Section 11.11 for the 
detection of cellular aging from cell lineage when death of cells can occur. 

Remark 1.5. Let V = {Vr,r > 0) be a Markov chain on a finite state space. We assume V 
is irreducible, with transition matrix R and unique invariant probability fi. Then it is well 
known, see ^4j, that ^ Yli=i h{Vi) converges a.s. to (/i, h) and that, to prove the fluctuations 
result, one solves the Poisson equation H — RH = h — {fj,,h), writes 

-J r 1 ^ 11 

V V V V 

and then uses martingale theory (we use similar techniques to prove the fluctuations in 
Theorem I1.3P to obtain the asymptotic normality of Y2l=i (Yi) ~ -^^(^-i)- It then 
only remains to say that RH{Vo) and RH{Vr) converge to to conclude. 

Assume that hypothesis of Theorem 11.31 hold and that x P*{x,A) is continuous for all 
A G B{S'^). Let h G Cfc(S'). Theorem 11.31 implies that 1{|g;|>o} jipy XlieT* h{Xi) converges in 
probability to (/i, /i)lny^o}- To get the fluctuations, that is the limit of 

1{|G;|>o} (^(^i) - 

V I ^r-l jgT* 

as r goes to infinity, using martingale theory, one can think of using the same kind of approach 
in order to use the result on fluctuations of Theorem II. 3i But then notice that what will 
correspond to the boundary term in pT]) at time r, RH{Vr), will now be a boundary term 
over the last generation G* whose cardinal is of the same order as |T*|. Thus the order of 
the boundary term is not negligible, which unable us to conclude. 
The fluctuations for X^ieT* h{Xi) are still an open question. 

1.5. Organization of the paper. We quickly study the auxiliary chain in Section [2j We 
state the flrst result on the weak law of large number in Section [3l Section H] is devoted to 
some preparatory results in order to apply results on fluctuations for martingale. Our main 
result. Theorem 15.21 is stated and proved in Section [5l The biological model of Section [1.1 1 is 
analyzed in Section [6l 

2. Preliminary result and notations 
Recall the Markov chain Y defined in Section [1.31 
Lemma 2.1. We have, for f G Bb{S) U B+{S), 

(12) E[/(y„)] = m-" Yl E[/(^Ol{.eT.}] = ^^^^^^^^l!"^*^^ = E[f{Xj)\I G T*], 

where I is a uniform random variable on G„ independent of X. 
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Proof. We consider the first equality. Recall that Yq has distribution u. For i = ii . . . in & 
we have, thanks to ([5]) and the definition of P* , 

E[/(X,)1{.6T*}] = = W, {p:,--- PL) />, 

so that 

»eG„ n,...,i„e{o,i} 

= (z., (Po* + PiT f) = m^u, QV) = m«E[/(y„)]. 

This gives the first equality. Then take / = 1 in the previous equality to get = 
SieG ^(^ ^ ^*) ^'^^ second equality of (fT2]) . The last equality of (fT2]) is obvious. 

□ 

We recall that u denotes the distribution of Xijf. Any function / defined on S is extended 
to S by setting f{d) = 0. Let -F be a vector subspace of 13{S) s.t. 

(i) F contains the constants; 
(ii) F2 :={/2;/eF}cF; 

(Hi) - F^F C L^{P{x, •)) for all x € S and P(/o ® /i) G F for all /o, /i G F; 

- For 5 e {0, 1}, F C L^iPsix, •)) for all x E 5 and P;(/) S F for all f £ F; 
(iv) There exists a probability measure /x on (S", 5) s.t. F C L^{^) and lim Ea;[/(y„)] = 

n^oo 

(/U, /) for all X G 5 and f G F; 
(v) For all f G F, there exists g € F s.t. for ah r G N, IQ*"/! < 5; 
(m) F C Li(i/). 

By convention a function defined on is said to belong to F if its restriction to 5* belongs to 
F. 

Remark 2.2. Notice that if (H) is satisfied and if x 1-^ P*g{x) is continuous on S for all 
g £ CbiS^) then the set CbiS) fulfills (i) - (vi). 



3. Weak law of large numbers 
We give the first result of this section. Recall notations Q and ([9]). 

Theorem 3.1. Let {Xi,i S T*) be a super- critical spatially homogeneous P*-BMC on a GW 
tree. Let F satisfy (i)-(vi) and f £ F. The sequence {MQ*{f),q S N) converges to {n, f)W 
in L"^, where W is defined by We also have that the sequence (M^* (/)1{|g*|>o}i ^ ^ 
converges to (^u, /)l{vi/^o} probability. 

Proof. We first assume that {fi, f) = 0. We have, 

II f{Xi)\\l, = e[( /(X,)l{.eT.})'] = E[/'(X,)l|,eTr*}] + = m«E[/2(y,)] + 5,, 
isGj isGq isGg 

with Bq = ^Y ^■•jg-]p,2|], where we used (fT^ for the last equality. 

Since the sum in Bq concerns all pairs of distinct elements of Gg, we have that i f\ j, the 
most recent common ancestor of i and j, does not belong to Gq. We shall compute Bq by 
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g-1 

decomposing according to the generation of k = i A j: Bq = Ck with 

r=0 fceGr 

Cfc = lE[/(X,)/(X,)l|(i,,)eT*2}]- 

If = (7 — 1, using the Markov property of X and of the GW process at generation g — 1, 
we get 

Cfc = E[ExJ/(X,)/(X,)l|(,,,)eT.2}]l{fcGT*}] 
(ij)eG2,iAi=0 

= 2E[P(/55/)(Xfc)l|fceTr*}]- 

If < g — 1, we have, with r = 

Cfc = 2 ^ E[Exfco[/(-'^i)l{i6T*}]Exfci[/(-^j)l{jeT'}]l{fc06T*,fcieT*}] 

= 2E[ J] Ex,o[/(^.)l{^eT.}] E {jGT*}]l{fcoeT%fcieT*}] 
= 2m2(«-'-i)E[Ex,o[/(Vr-i)]lEx,,[/(Vr-i)]l{fcoeT*,fciGT*}] 

where we used the Markov property of X and of the GW process at generation r + 1 for the 
first equahty, (jl2p for the third equahty and the Markov property at generation r for the last 
equahty. 

In particular, we get that Ck = 2m2(«-'-^)E[P(Q'?-'^- V ® Q''"'^" V)(^fc)l{fc6T-}] for all k 
s.t. |A:| < q — 1. Using (fT2]) . we deduce that 



q-l 

B, = 2Y^'^'""'^ E nPiQ'""'f®Q'-'-'f)iXk)l{k^j^y] 

g-1 

= 2 E m^^-''\u, Q^P{Q'i-'-^f ® Q"-^- V))- 

r=0 

Therefore, we get 

l|MG*(/)||i. =m-2«||E/(^0lli. 

ieG* 

5-1 

(13) = m-'?E[/2(yg)] + 2m-2 ^ ^"'^(z., Q"- PiQ'i-'-^ f ® Q''-'~^f)). 

r=0 

As / S F, properties (n), (iv), {v) and (vz) imply that lim m~''E[/^(y„)] = 0. Properties 

q—>oo 

(in), (iv) and (v) with /) = implies that P{Q'i~'^~^ f Q"^^*""^/) converges to as g 
goes to infinity (with r fixed) and is bounded uniformly in (7 > r by a function of F. Thus, 
properties (v) and (vi) imply that {v-,Q^ P{Q''~^~^ f ® Q''^^^'^ f)) converges to as g goes to 
infinity (with r fixed) and is bounded uniformly in g > r by a finite constant, say K. For 
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any e > 0, we can choose rg s.t. J2r>ro — ^ 1o ^ '''o s-t. for q > go and r < tq, we 

have \ QTiQ'^-'-^f Q'^^'^^V))! < e/ro- We then get that for ah q > qo 

q-l ro g-1 

r=0 r=Q r=ro+l 

g-1 

This gives that hm V m-''(z^, Q'^P(Q'?-^-i/ ^ g?-'-!/)) = q. Eventually, we get from 

r=0 

that if {fi, f) = 0, then lim \\MG*{f)\\L^ = 0. 

+00 ^ 

For any function / € F, we have, with g = f — {f^, f) , 

MG'(/) = MG.(5) + (/U,/)m-^|G*|. 

As 5 € F and (^,17) = 0, the previous computations yield that lim ||M(g* (5)||^2 = 0. As 



(jr— >oo 

{m~'^\G*\, q > 1) converges in (and a.s.) to W, we get that M^*{f) converges to {n, f)W 
in L\ 

Then use that m~'^|G*| converges a.s. to W to get the second part of the Theorem. □ 

We now prove a similar result for the average over the r-th first generations. We set 
= E[|T*|], see d?]). We first state en elementary Lemma whose proof is left to the reader. 

Lemma 3.2. Let {vr,r G'N) be a sequence ofnon negative real numbers converging to a ^ M+, 
and m a real such that m > 1. Let 



Wr 



E 

q=0 



m'i-^-^Vq. 



Then the sequence {wr,r £ N) converges to a/{m — 1). 

Recall notations ([6]) and ([9]). 

Theorem 3.3. Let {Xi,i G T*) be a super- critical spatially homogeneous P*-BMC on a GW 
tree. Let F satisfy (i)-(vi) and f £ F. The sequence {Mj*{f),r G N) converges to {fi, f)W 
in L^ , where W is defined by We also have that the sequence {Mj*{f)l^\j*\^f)j,r G N) 
converges to {n, f)l^^/^Qy in probability. 

Proof. We have 



ieT* 



L2 



q=0 



L2 



< 



E 



q=0 



Xf)-{i^J)w 



L2 



m — 1 



1 — m~ 



9=0 



q— r— 1 



MG*if)-{f^,f)W 



L2 



The first part of the Theorem follows from Theorem 13.11 and Lemma 13.21 

Use that m~'^\G*\ converges a.s. to W to deduce that t~^|T*| converges a.s. to W, and 
thus get the second part of the Theorem. □ 
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We end this section, with an extension of the results to functions defined on the mother- 
daughters quantities of interest Aj = {Xi, XiQ, Xn) G S^. Recall notations ([6]) and ([9]). 

Theorem 3.4. Let {Xi,i G T*) be a super- critical spatially homogeneous P*-BMC on a 
GW tree. Let F satisfy (i)-(vi) and f G B{S^). We assume that P*f and P*{P) exist 
and belong to F. Then the sequences {Mg,*{f),q G N) and {Mj*{f),r G N) converge to 
{fi,P*f)W in L^, where W is defined by and the sequences (M([;*(/)l{|(g.|>o}) 9 ^ and 
(^t;(/)1{|g*|>0}:^ e N) converge to {fi, P* f)l{w^o} in probability. 
Proof. Recall that M^^if) = J2ieG* fi^i)- The Markov property for BMC gives 

iiMG.(/)iii. = \m.{p*f)\\i,+E[MG,{p*{f') - {p*ff)]. 

Since {m~i Mg*{P* (f^) - {P*ff),q G N) converges to {fi,P*{f^) - {P* ff) in and thus 
in L^, we have that m~^'^E[M(G*(P*(/^) — (P*/)^)] converges to as g goes to infinity. Then, 
we deduce the convergence of (M^. G N) and (Mg* (/)l{|(g*|>o}) 9 £ I^) from Theorem 

EH 

The proof for the convergence of {Mj*{f),r G N) and (Mt* (/)1{|g*|>o}) ^ I^) mimics 
then the proof of Theorem 13. 3i □ 

4. Technical results about the weak law of large numbers 

The technical Propositions of this section deal with the average of a function / when 
going through T* via timescales (r„(t),t G [0, 1]) preserving the genealogical order. Roughly 
speaking, these timescales allow to visit the sub-tree T*. In order to define (x„(t),t G [0, 1]) 
we need to define /*, set of the n "first" cells of T*. Let {Xi,i G T*) be a super-critical 
spatially homogeneous P*-BMC on a GW tree and Q be the cr-field generated by {Xi,i G T). 

• We consider random variables (H*, g G N*) which are conditionally on Q, independent 
and s.t. n* is distributed as a uniform random permutation on G*. In particular, 
given |G*| = k, (n*(l), . . . , n*(A:)) can be viewed as a random drawing of all the 
elements of G*, without replacement. 

• For each integer n G N*, we define the random variable p„ = inf{A;;n < |T^|}, with 
the convention inf = oo. Loosely speaking, p„ is the number of the generation to 
which belongs the n-th element of T*. Notice that pi = 0. 

• Let n be the application from N* to T* U {dj}, where is a cemetery point added 
to T*, given by n(l) = and for A; > 2 : 



n;,(^-|T;,-ii) ifPfc<+oo 

dj if Pfc = +00. 



Notice that 11 defines a random order on T* which preserves the genealogical order: if 
k < n then |n(A;)| < |n(n)|, with the convention \dj\ = oo. We thus define the set of the n 
"first" elements of T* (when |T*| > n): 

(14) /* = {n(A;),l < A: < n A |r|}. 

We can now introduce the timescales: for n > 1, we consider the subdivision of [0, 1] 
given by {0, s„, . . . , sq}, with Sk = m,~^ . We define the continuous random time change 
(r„(t),t G [0,1]) by 

'm"t, tG[0,m-"], 
|T;_fc| + {mh - l){m - l)~^\Gl_,^,-^l t G [m-^ m-'^+^], 1 < k < n. 



(15) Tnit) 
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Notice that T„(t) < |T*|. The set /j*^ ^^^j , with t £ [0,1], corresponds to the elements of 

log(t) 
log(m) - 



T*_^, with k = L~l^7^J + 1) "first" fraction (m'^t — l)/(m — 1) of the elements of 



generation G*_j^,^_^. 

Recall ([9]). For the sake of simplicity, for any real x > 0, we will write M*(/) instead of 
Mi* (/), with the convention that M^{f) = 0. 

\_x\ 

Proposition 4.1. Let F satisfy (i)-(vi), f G F and t G [0,1]. The sequence {m~"'M* 
n G N*) converges to {^x, f)m{m — \)~'^Wt in . 

Proof. We first consider the case = 0. If t = 0, then r„(t) = and Mq(/) = by 

convention. Let t G (0,1]. There exists a unique k >1 such that mr'^ <t< m^^^^. For 
n'> k, we have, using (fT5|) and that IT preserves the order on T*, 



where = {n(i),|T;_^| < i < [Tn{t)\}. Notice that J„ = if [T^.^+J = and that, by 
convention, we then have Mj^{f) = 0. Theorem 13.31 implies that m~^M'f* ^(/) converges to 

in as n goes to cxd. Recall Q is the ci-field generated by {Xi,i G T). Since Jn C G*_^^;^, 
we have 

nMjM?\Q] = E /(^.)/(^.)E[i{,,6j„}|r]. 

Thanks to the definition of H, we have for i,j G Gn-k+i 

l{ijeG;_fc+i}^[l{i,iGJn}|Tr*] = l{i,jeG;_,+j(l{i^i}X2 + l{»=i}Xi), 
where, with a = [{m^t — l)(m — 



n-fe + l 

n-fc+llJ ' 



a a(a — 1) 

Xi = -TFn; r and X2 - 



\K-k+i\ \K-k+i\i\K-k+i\ - 

Thus, we get 

e[MjM?\Q]=X2 E fix^)f{x,) + {xl-x2) E /'(^«) 

as < X2 < Xi ^ 1- We have 

(16) ||m-"Mj„(/)2||2, < ||m-"MG*_^^^(/)||i. +m-"||m-«MG;_^^^(/2)||^i. 

The first term of the right hand-side of (jl6p converges to {fi, f)W = as n goes to infinity, 
thanks to Theorem 13.11 The same Theorem entails that ||m~"'M(g* ^.^^ (/■^)||li converges to 
E,[{fi, f'^)W], and consequently the second term of the right hand-side of ([T6|) also converges 
to as n goes to infinity. We deduce that the sequence {m~^Mj^{f)'^,m G N*) converges to 
in L2. 

Since m-"M;^(^)(/) = m~'^Mj*^_^{f) +m-"^Mj^{f), the sequence {m-'^M*^^^^{f),n e W) 
converges to in L^. 
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Next, we consider the case (/i,/) 7^ 0. We set g = f — (/i,/). Since m~"M* (^)(/) = 
m~^M* (J) (5) + {l^T f)T^~^\jn{t)\^ the Proposition will be proved as soon as we check that 
(m~" [r„(t)J , n G N*) converges to m{m — l)~^tW in L^. 

The case t = is obvious. For t G (0, 1], there exists a unique k>l such that m^^ < t < 
^-fc+i_ -yy-g deduce from (fTSj) that, for 1 < /c < n, 

m r„(t) = (m-l) (^^^("^ ^^^^ + rn^-^+i ^ ~ V' 

Since both m^"|G*| and t^^|T*| converges to W in L^, we finally obtain that m~^Tn{t) 
converges to m(m — l)~^tW in L^. □ 

We deduce from (fT5|) and ([7|), that for t G (0, 1], n > A;, where A; = [— ^og(m) J + 1) have 

E[r„(t)] = t„„fe + (m'^t - l)(m - l)~^m''~''+^ = (m"+H - l)(m - 1)-^ 

Thus, Proposition HTT] implies that (E[r„(i)]~^M*^^^^(/), n G N*) converges to {fJ.,f)W in 
for all t G [0, 1] , which generalizes Theorem I3.3i 

In fact the convergence in Proposition 14.11 is uniform in t. 

Corollary 4.2. Let F satisfy (i)-(vi), f e F s.t. \f \ G F. We set Rn{t) = m-"^ M*^^^^^{f) - 
{fi, f)m{m — The sequence (supjgjo.i] \Rnit)\in ^ N*) converges to in L^. 

Proof. Let f G F s.t. |/| G F. We set /+ = max(0, /) and /- = max(0,-/). As F 
is a vector space, we get that = (/ + |/|)/2 and f^ = f^ — f belong to F. Notice 
that \Rn{t)\ < \Rtit)\ + \Rn{t)\, where Ri{t) = m-"M;^(^) (/^) - {fi,f^)m{m - ly^Wt for 
d G {+, — }. So it is enough to prove the Corollary for / non-negative. As 1 1-^ m^"M* 
and t {n, f)m{rn — l)~^tW are non-decreasing and Rn{0) = 0, we get that for > 1, 

1 ^ 
sup \Rn{t)\<-{fi,f)mim-l)-'W + y2\Rn{k/N)\. 
tern N ^ 

Now, use that W G L? and that Rn{t) goes to in Lp' for all t G [0, 1] to get the result. □ 

We have a version of Proposition 14.11 and Corollarv 14.21 for functions defined on S^. 

Proposition 4.3. Let F satisfy (i)-(vi), g G B{S^) s.t. P*g and P*{g'^) exist and belong to 
F . Let t G [0, 1]. The sequence {m~^M* (^£^{g), n G N*) converges to {fi, P* g)m{m — l)~^tW 
in L^. 

Furthermore, if P*\g\ and P*{g\g\) also belong to F then (sup^gp^i] |-R„(t)|,n G N*) con- 
verges to in L^, where Rn{t) = m~"- M*^^^^{g) - P*g)m{m - l)~^Wt, for t G [0, 1]. 

Proof. The proof of the first part is similar to the proof of Theorem 13.41 The proof of the 
second part is similar to the proof of Corollary 14.21 □ 



5. Fluctuations 

Recall dl]). For any real x > 0, using notations from the previous Section, we will write 
M*{f) for Mi* (/), with the convention that M^(/) = 0, where is defined by (HH). We 

shall prove a central limit theorem for the sequence {M*{f),n > 1), based on martingale 
theorems. 
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We set Hn = a{I\f^^^yl < A; < n A |T*|) V cr(n(A;), 1 < < n + 1) for n > 1, = 
o"(X0) and 7i = (7in,n G N) for the corresponding filtration. With the convention that 
Xq^ = d, we notice that -'^n(„_|_i) is Wn-measurable. Indeed, given (n(A;),l < k < n + 

1), if n(n + 1) 7^ dj, we have n(n + 1) = for some j G {1, . . . ,n} and i G {0, 1}, 

and as A^^^) = {Xf^^^y Xf^^j-^^, Xf^^^-^^) £ Tin, we deduce that -'fn(„+i) is Wri-measurable. 
In particular, as {|T*| > n + 1} G Hn, this implies that l[\j*\>n+i}^[f{^fi(^n+i))\'^n'\ ~ 
l{|T*|>ra+i}^*/(-'^n(„+i))) fo'^ ^-^y / ^ '6(5''^) such that P*f is well defined. If in addition 
P*f = 0, then (M*(/),n G N) is an ?^-martingale. 

We shall first recall a slightly weaker version of Theorem 4.3 from [15] on martingale 
convergence. (Theorem 4.3 from [15j is stated for filtrations which may vary with n.) 

For u G M'^, we denote by u' its transpose. Let 7i = {7ii, i G N) be a filtration. If {Di, i G N) 
is a sequence of vector valued random variables W-adapted and such that E[Dj+i|'Hj] = for 
all i G N, then {Di,i G N) is called an "H-martingale difference. 

Theorem 5.1 (Theorem 4.3 from [15]). Let 7i = (7ii,i G N) 6e a filtration. For all n G N*, 

let {Dn^i = {D^l, . . . , Dl^l),i G N) 6e a sequence ofW^-valued random vectors and an TL- 
martingale difference. For each n G N, let {Tn{t),t G [0,1]) be a non- decreasing cddldg 
function s.t. Tn{t) is a Ti-stopping time for all t G [0,1]. Let {T{t),t G [0,1]) be a M'^^'^- 
valued continuous, possibly random, function. We assume the following two conditions hold: 

(1) Convergence of the timescales. For all t G [0,1], we have the following convergence 
in probability: 

Tnit) 

[Dn4Dn,iy\ni-l] T{t). 

i=l 

(2) Lindeberg condition. For all e > 0, 1 < i < d, we have the following convergence in 
probability: 

^n(l) 

I fj\ ^ TO 

0. 



i=l 



Then (E1=i"^ Dn,un£W) converges in distribution to Bq- in the Skorohod space B([0, 1]) 
ofW^-valued cddldg functions defined on [0,1], where, conditionally onT, {Bq-{t),t > 0) is a 
Gaussian process with independent increments and Bq-it) has zero mean and variance T{t). 

Furthermore the convergence is stable: if {Yn,n G N) converges in probability to Y, then 
i(^i=i^^ Dn,'hYn),n G N) converges in distribution to {Bq-,Y), where Bq- is conditionally 
on (T, Y) distributed as Bq- conditionally on T, and the distribution of (T, Y) is determined 
by the following convergence 

( V E [Dn,i{Dn,iy\ni-l] ,y„) {T,Y). 
i=l 

For the sake of simplicity, we will write P*h^ for P*{h^), and if /i = {hi, . . . , h^) is an R'^ 
valued function, we will write P*h for {P*hi, . . . , P*h^) and {fj,, h) for ((/i, hi), . . . ,{fi, hd))- 

Theorem 5.2. Let {Xi,i G T*) be a super- critical spatially homogeneous P*-BMC on a GW 
tree and Tn be defined by fT5\) . Let F satisfy (i)-(vi). Let d > 1, d' > 1, f = (/i, . . . , fd) G 
B{S^Y,g = {gi, . . .,gd') G B{S^ f such that P* f^ , exist and belong to F for all 1 < £ < d 
and I < k < A, P*gi, P*\gi\, P*gj and P*ge\ge\ exist and belong to F for all 1 < i < d' . Let 
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S 6e a square root of the symmetric positive matrix m{m — 1)^^ {fj,, P*{f f) — {P* f){P* f)') 
and 7 = m{m — l)'^ {fi, P* g) . 

Then, the sequence (m~"/^M* ^ -^{f — P* f),m^"'M* ^,^{g)) converges in distribution in the 

Skorohod space B([0, 1],M'^^'^ ) of W''^^ -valued cddldg functions defined on [0, 1], to the process 
{Ti^/WB^'-^Who), where B is a d-dimensional Brownian motion independent of W , defined 
by and Hq is the identity function t t. 

Proof. Notice that defined by (fTSl) is a non-decreasing continuous function s.t. r„(t) is a 
W-stopping time for all t £ [0, 1]. We set for all n,i £ N*, 

Dn, = (/(Aa(,)) - P*/(^n(.))) 1{.<|t1}, 

so that {Dn^i, i G N) is an 'H-martingale difference. Notice the matrix P* ff'—{P* f){P* f)') 
is indeed symmetric and positive, so that S is well defined. 
Notice that 



E [D^,{Dn,iy\n.-i] = m-" (P*(//')(Xn(,)) - (P*/)(^n(,))(^*/)'(^n(,)) j 1{.<|t*|}. 

The convergence of the timescales condition of Theorem 15.11 with T{t) = Ti'^Wt, is then a 
direct application of Proposition 14.11 
For 1 < ^ < d, we have 



E 



(<!)'1||Z)W|>,}|W- 



(Z?3)^|W,_i = 6-^m-^"P*(/,-P*A)^(^n(,))l{^<|T*|}. 



The Lindeberg condition of Theorem 15.11 is then a direct application of Proposition 14.11 

Notice the second part of Proposition 14.31 implies the convergence of 1^ = m~^M* ^.^{g) 
to 'jWho in probability in the Skorohod space. We then deduce the result from Theorem 
EH □ 

The following result is an immediate consequence of Theorem 15. 2[ 

Corollary 5.3. Let {Xi,i £ T*) be a super- critical spatially homogeneous P*-BMC on a GW 
tree. Let F satisfy (i)-(vi). Let f G B{S^) such that p* f^ exists and belongs to F for all 
1 < A: < 4. Leta^ = (/i,P*/^ - {P* f?) ■ 

Then we have the following convergence in distribution: 

i{iG,i>o}|T;r^/^ j:/(a.)-p*/(^.) ^ i{wo}-G, 

where G is a Gaussian random variable with mean zero and variance 1 independent of W , 
which is defined by f^)- 

Proof Notice that E.eT* /(A.) = M;^(,)(/) -M;^(,)(P*/), |T;| = M;^^^^{1) and 

that l{|G*|>o} converges a.s. to lny-^g}- Then, to conclude, use the stable convergence of 
Theorem 15. 2| and the fact that the marginals at time 1 converge since the limit is continuous. 

□ 

The next result gives that the fluctuations over each generation are asymptotically inde- 
pendent. 

Corollary 5.4. Let {Xi,i G T*) be a super- critical spatially homogeneous P*-BMG on a GW 
tree. Let F satisfy (i)-(vi). Let d > 1, fi, . . . , fd £ B{S^) such that P* f^ exist and belong to 
F for all I < I < d and I <k < A. Let aj = (/x, P* fj - {P* ftf) for 1 < £ < d. 
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We set for f G 8(3^) 

A^n(/) = |G;|-l/2(MG*(/-P7))- 

Then we have the following convergence in distribution: 



(^n(/l), • • • , ^n-d+l(/d)) 1{|G*|>0} ^ ^ ^ '^{W^O}{'^lGl, ■ ■ ■ ,0-dGd), 



where Gi, . . . , are independent Gaussian random variables with mean zero and variance 1 
and are independent ofW, which is defined by 

Proof. Notice that for n > /c > 0, 

iv„-.(/)i,l=:l>o, = .,(i)-Af, ..,(1) 

and 1{|G* |>o} converges a.s. to 1{vk^o}- To conclude, use the stable convergence of Theorem 
and that the increments of the Brownian motion are independent. □ 



The extension of the two previous Corollaries to vector-valued function can be proved in 
a very similar way. 



6. Estimation and tests for the asymmetric auto-regressive model 

We consider the asymmetric auto-regressive model given in Section 11.11 Notice that the 
process {Xi,i G T) defined in Section ll.ll with the convention that Xi = d \i the cell i is 
dead or non existing is a spatially homogeneous BMC on a GW tree. We shall assume it is 
super-critical, that is 2pifi + pi + pQ > 1. 

We compute the maximum likelihood estimator (MLE) 

an ^ - n an o-n on r-Zn /o/n r-Zn o/n -n -n -n\ 

given by ([^ and — (o'", yo"", (Jq, (t") of k — (cr, p, uq, ci) , based on the observation of a 
sub-tree Tr*^-^. Let T^'^ be the set of cells in T* with two living daughters, (resp. T^) be 
the set of cells of T* with only the new (resp. old) pole daughter alive: 

T^'O = {i E t;, Ai E S^}, TO = E T;, a, E 52 X {d}} and = {i G T^, E 5x {5} x S}. 
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It is elementary to get that for 6 £ {0, 1}, 



(17) 



(18) 4" = iT^-r ' E - "^iTTn'r ' E ^- 



1(5 J 



«5 



6T« ieT« ieT^, 



r|pl,0| Irrn*? 



(19) ^^i,o = W' 



It* I 

and 

A^n I ^^^1,0 (CT j l^n I .^^1,0 l^"^eT^ 

The residues are 

iiS = Xis-a'iXi- P2 foriGT^'O, and e'^s = Xis - Xi - for i e T^. 

Notice that those MLE are based on polynomial functions of the observations. In order to 
use the results of Sections [3] and [U we first show that the set of continuous and polynomially 
growing functions satisfies properties (i) to {v) of Section [2j The set of continuous and 
polynomially growing functions Cpoi(M) is defined as the set of continuous real functions 
defined on M, s.t. there exists m > and c > and for all x E M, |/(2;)| < c(l + Ixl™). It is 
easy to check that Cpoi(M) satisfies conditions To check properties {iv) and (f), we 

notice that the auxiliary Markov chain Y = (Yn,n G N) can be written in the following way: 

Yn+ 

with hn = 6'„ + SnCnj where {{an,b'j^, Sn),n > 1) is a sequence of independent identically 
distributed random variables, whose common distribution is given by, for 6 S {0, 1}, 

(20) P(ai = as, b\ = Ps, si = a) = ^ and P(ai = a's, h\ = P'g, si = as) = 

m m 

{en,n > 1) is a sequence of independent AA(0, 1) random variables, and is independent of 

{{an, b'n, Sn),n > 1), and both sequences are independent of Yq. Notice that Yn is distributed 

as Zn = 0102 • • • a„_ia„lo + I]fc=i «i«2 • • • flfc-i^fc- Since \ak\ < max(|ao|, |ai|, |aol' |Q^iI) < 1 

for all k € N*, we get that the sequence {Zn,n G N) converges a.s. to a limit Z. This implies 

that Y converges in distribution to Z. Following the proof of Lemma 26 in |9j , we get that 

Cpoi(M) fulfills properties {iv) and {v), with ^ the distribution of Z. 

Proposition 6.1. Assume that the distribution of the ancestor ^0 has finite moments of all 
orders. Then (l{|(g,. |>o}^"i ^ 1) ^-''^d (1{|(g. |>o}k", re > 1) converges in probability respec- 
tively to lny_^o}^ o,''^d, l^yy^ojK, where W is defined by 
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Proof. The hypothesis on the distribution of Xijj imphes that Cpoi(M) fulfills {vi). The result 
is then a direct consequence of Theorem 13.41 □ 

Remark 6.2. Using similar arguments as in Proposition 30 and 34 of [9], it is easy to deduce 
from (jl3p and the proofs of Theorem 14 and Proposition 28 of [9] that the convergences in 
Proposition 16.11 hold a.s., that is the MLEs 0" and are strongly consistent. 

From the definition of Z„, we deduce that in distribution Z = aiZ' + 5i, where Z' is 
distributed as Z and is independent of (oi, bi) (see ([20]) for the distribution of (oi, 5i)). This 
equality in distribution entails that 



21 



and fi2 = 



1 — a 

We can now state one of the main result of this section. 



1 



where a = E[ai], = E[aj], /3 = E[bi], (3^ = E[6?], a/3 = E[ai6i] and = E[sf] 



Proposition 6.3. Assume that the distribution of the ancestor has finite moments of 
all orders. Then 'i-{\Gi\>o}\^n\^^'^ {0"' — 0) converges in law to 1s^\y^q\Gii, where Gu is a 
11-dimensional vector, independent ofW defined by (B^, with law M{0,T,) where 



( T^K/pi^o 
pa'^K/pifl 





'1,0 



V 








pa'^K/pifl 
a^K/pi 














cr^K/po 












\ 






with 



K={fl2 



1 



^2 



and r 



Pi,o(l-Pi,o) -PoPi.o -PiPi,o 
-PoPifi Po(l-Po) -PoPi 
-PiPifi -PoPi Pi(l-Pi) 

The proof of Proposition 16.31 relies on Theorem 15.21 and mimics the proof of Proposition 
33 of |9|. It is left to the reader. 



Remark 6.4. Proposition 16.31 deals with the asymptotic normality of the MLE of 6 based 
on the observation of the sub-tree T*^_]^. If L{Xi,i G T*^]^,^) denotes the corresponding 
log-likelihood function for 9, the Fisher information, say In+i, is given by 

-d^L{x„ieT*^+„ey 



-E 



dOdO' 



Using Theorem 13. 4| one can check that lim„^oo /„_|_i/E[|T*_,_-^|] = S^-"^. This is the analogue 
of the well-known asymptotic efficiency of the MLE for parametric sample of i.i.d. random 
variables. 

Let 01,0 (resp. ^f g) stand for {oq, (3o,ai, l3i) (resp. (oq , /3q , a", 

Remark 6.5. Proposition 16.31 is quite similar to Proposition 33 in [9]. One of the main 
differences comes from the factor p^^ in front of the matrix K in the asymptotic covariance 
matrix for the estimation of Oi^ with 0"q. As a matter of fact, this factor comes from the 
normalization by |T* 1^/^, number of living cells up to generation n, whereas this estimation is 
related to the cells with two living daughters, which would induce a normalization by |T„' \^^'^. 



Since 1 



{|T*|>0}|Tn 



1,0| 



|T* I converges in probability to Pi,ol{VF^o}) such a normalization would 



suppress the factor p^ q, see the following Corollary. 
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Corollary 6.6. Assume that the distribution of the ancestor X0 has finite moments of all 
orders. Then Iji^* |>o}|Tn' |^^^(^io ~ ^1,0) converges in law to l^iY^QjG^, where G4 is a 
4-- dimensional vector, independent ofW defined by with law M{0,T,') where 

This result is formally the same as Proposition 33 of |9j, but one should notice that fii and 
fi2 are not defined the same way as in [9], since here they also depends on the parameters 
concerning cells with dead sisters see equations ([2]), ([3]), (f20l) and (f2T]) . 

In order to detect cellular aging, see [9] in the case of no death (m = 2), we consider the 
null hypothesis Hq = {(ao,/3o) = (ai,/3i)}, which corresponds to no aging and its alternative 
Hi = {{ao,Po) / (qi,/3i)}. Notice that 6 fii{e) and {e,K) ^2(6*, k) given by ^ 
are continuous functions defined respectively on = ((—1, 1) x M)^ x ([0, 1]^ \ {0,0,0}) and 
9x]0, +oop. We set /i" = /ii(^") and = ^2(^", k"). Proposition 16.71 allows to build a test 
for Hq against Hi. Its proof, which is left to the reader, follows the proof of Proposition 35 of 
[9j and uses Corollary 16.61 the value of the extinction probability rj = ¥{W = 0) = 1 — 
where W is defined by ([8]) and Remark 16. 2[ 

Proposition 6.7. Let U and V be two independent random variables, with U distributed as 
a with two degrees of freedom and V a Bernoulli random variable with parameter 1 — rj. 

Assume that the distribution of the ancestor Xijf has finite moments of all orders and define 
the test statistic 

Then, the statistics l{|G*|>o}Cra converges under Hq in distribution to UV, and under Hi a.s. 
to on {V = 0} and +00 on {V = 1}. 

Remark 6.8. Let us assume that: 

• Death occurs, that is m G (1,2). 

• There is no difference for the marginal distribution of a daughter according to her 
sister is dead or alive; that is a'g = as and f3'^ = f3s for 5 G {0, 1}. 

• For simplicity, the death probability is symmetric, that is pq = pi. 

If one uses the statistics given by Proposition 33 in [9] with all the available data, that is if 
one uses 

• Formula ([HD and ([HD with Tn'° replaced by T^'" U T^; 

• The variance estimator: 

i-n' = w^TT ( E (4 + 4) + E 4 + E 4) ; 

(Notice that we divide by |'ir*^;^| — 1 as this is equal to the total number of data: 
2|Ti'°| + |T°| + |'iri|.) 

• Keep the same estimation of the correlation: = - — m, J^ v-ti o ^io^n'-, 

then one check that, as n goes to infinity, l||(K.|>o}|'rnl"^''^(^"' — converges in distribution 
to l^y\/^QjG, where G is a centered Gaussian vector with covariance matrix 
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where K is as in Proposition I6.3| and G is independent of W , which is defined by dS]). Then, 
it is not difficult to check that the statistics proposed by Guyon in Proposition 35 of [9], 
converges under Hq towards cUV, with U and V as in Proposition 16.71 and 

As p G [—1,1], pifi+pi > 1/2 (because m > 1 and po = Pi) and 2pi+pifi < 1, one can check 
that c > 1. In particular, using the test statistic designed for cells with no death to data of 
cells with death leads to a non-conservative test. 
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